Are Initial / Final Units Acoustically Accurate ?
نویسندگان
چکیده
| We show a comparative study of subword unit segmentation of Mandarin speech data. Most HMM recognition systems use intial//nals as subword units for Mandarin speech. We nd that such a division of monosylla-ble data into intial//nal units are not always supported by acoustic evidences. We implement a delta MFCC based seg-mentation method and compare its output with that of Viterbi segmentation based on initial//nal units. We found that whenever the two output diier, the acoustic based method outperforms the initial//nal based method. This raises the questions of whether we should use intial//nals as subword units for training HMMs in large vocabulary Mandarin speech recognition
منابع مشابه
Automatic segmentation and clustering of speech using sparse coding and metaheuristic search
We propose a constrained shift and scale invariant sparse coding model for the purpose of unsupervised segmentation and clustering of speech into acoustically relevant sub-word units for automatic speech recognition. We introduce a novel local search algorithm that iteratively improves the acoustic relevance of the automatically-determined sub-word units from a random initialization by repeated...
متن کاملAccuracy of perceptually based and acoustically based inspiratory loci in reading.
Investigations of speech often involve the identification of inspiratory loci in continuous recordings of speech. The present study investigates the accuracy of perceptually determined and acoustically determined inspiratory loci. While wearing a circumferentially vented mask connected to a pneumotach, 16 participants read two passages. The perceptually determined and acoustically determined in...
متن کاملMapping from sound to meaning: reduced lexical activation in Broca's aphasics.
Recent studies of lexical access in Broca's aphasics suggest that lexical activation levels are reduced in these patients. The present study compared the performance of Broca's aphasics with that of normal subjects in an auditory semantic priming paradigm. Lexical decision times were measured in response to word targets preceded by an intact semantically related prime word ("cat"-"dog"), by a r...
متن کاملDestruction of Recombinant Tissue Plasminogen Activator (rtPA) -Loaded Echogenic Liposomes under Dual Frequency Sonication
Background:Echogenic liposomes (ELIPs) encapsulate drugs and gas bubbles within lipid vesicles. The destruction of ELIPs in response to MHz and kHz ultrasound waves has been studied previously. Applying ultrasound above a certain threshold causes encapsulated gas bubbles destruct rapidly by fragmentation or more slowly by acoustically driven diffusion. This study compares the d...
متن کاملSpeech recognition based on acoustically derived segment units
This paper describes a new method of word model generation based on acoustically derived segment units (henceforth ASUs). An ASU-based approach has the advantages of growing out of human pre-determined phonemes and of consistently generating acoustic units by using the maximum likelihood (ML) criterion. The former advantage is e ective when it is di cult to map acoustics to a phone such as with...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007